Valuing search and communication in partially-observable coordination problems
نویسندگان
چکیده
+ Bayesian coordination games # + # $ . # # C # C . $ < # , # # " C " # # ,
منابع مشابه
A POMDP Framework to Find Optimal Inspection and Maintenance Policies via Availability and Profit Maximization for Manufacturing Systems
Maintenance can be the factor of either increasing or decreasing system's availability, so it is valuable work to evaluate a maintenance policy from cost and availability point of view, simultaneously and according to decision maker's priorities. This study proposes a Partially Observable Markov Decision Process (POMDP) framework for a partially observable and stochastically deteriorating syste...
متن کاملDecentralized control of multi-robot partially observable Markov decision processes using belief space macro-actions
This work focuses on solving general multi-robot planning problems in continuous spaces with partial observability given a high-level domain description. Decentralized Partially Observable Markov Decision Processes (DecPOMDPs) are general models for multi-robot coordination problems. However, representing and solving DecPOMDPs is often intractable for large problems. This work extends the Dec-P...
متن کاملThe Size of Message Set Needed for the Optimal Communication Policy
Communication is a key for facilitating multi-agent coordination on cooperative problems. In our previous work, we proposed Signal Learning (SL) and Signal Learning with Messages (SLM) by which agents learn local policies of communication and action simultaneously in MultiAgent Reinforcement Learning (MARL) framework. Our experimental results showed that both SL and SLM can improve the performa...
متن کاملSidekick agents for sequential planning problems
Effective Al sidekicks must solve the interlinked problems of understanding what their human collaborator's intentions are and planning actions to support them. This thesis explores a range of approximate but tractable approaches to planning for AI sidekicks based on decision-theoretic methods that reason about how the sidekick's actions will effect their beliefs about unobservable states of th...
متن کاملDecentralized Decision-Making Under Uncertainty for Multi-Robot Teams
Automatically generating solutions to general multi-robot coordination problems with communication limitations is challenging, but crucial in many domains. As one way to address this problem, we describe a probabilistic framework for synthesizing control policies for general multi-robot systems based on decentralized partially observable Markov decision processes with macro-actions (MacDec-POMD...
متن کامل